gpaligner: a fast algorithm for global pairwise alignment of dna sequences

Authors

mostafa hadian dehkordi

ali masoudi-nejad

morteza mohamad-mouri

abstract

bioinformatics, through the sequencing of the full genomes for many species, is increasingly relying on efficient global alignment tools exhibiting both high sensitivity and specificity. many computational algorithms have been applied for solving the sequence alignment problem. dynamic programming, statistical methods, approximation and heuristic algorithms are the most common methods applied to this problem. we introduce gpaligner, a fast pairwise dna-dna global alignment algorithm. gpaligner uses similar score schema with dialign-t to produce the final alignment. it also uses the concept of “spaced seeds” to determine locally aligned subsequences which construct semi-global alignment as the preliminaries of global alignment computation. this enables gpaligner to have the precision provided by the dialign-t algorithm in considerably less time and space complexities. we performed benchmarking of our approach based on numerous datasets from standard benchmarking databases and real sequences of ncbi database where gpaligner performed three times faster than dialign-t. gpaligner is a new alternative for having sensitivity and selectivity of dialign-t but with less computational cost.

Upgrade to premium to download articles

Sign up to access the full text

Already have an account?login

similar resources

gpALIGNER: A Fast Algorithm for Global Pairwise Alignment of DNA Sequences

Bioinformatics, through the sequencing of the full genomes for many species, is increasingly relying on efficient global alignment tools exhibiting both high sensitivity and specificity. Many computational algorithms have been applied for solving the sequence alignment problem. Dynamic programming, statistical methods, approximation and heuristic algorithms are the most common methods appli...

full text

Minimap2: fast pairwise alignment for long DNA sequences

Motivation: Recent advances in sequencing technologies promise ultra-long reads of ∼100 kilo bases (kb) in average, full-length mRNA or cDNA reads in high throughput and genomic contigs over 100 mega bases (Mb) in length. Existing alignment programs are unable or inefficient to process such data at scale, which presses for the development of new alignment algorithms. Results: Minimap2 is a gene...

full text

a fast algorithm for exonic regions prediction in dna sequences

the main purpose of this paper is to introduce afast method for gene prediction in dna sequences based on the period-3 property in exons. first, the symbolic dna sequences are converted to digital signal using the eiip method. then, to reduce the effect of background noise in the period-3 spectrum, we use the discrete wavelet transform (dwt) at three levels and apply it on the input digital sig...

full text

Net2Align: An Algorithm For Pairwise Global Alignment of Biological Networks

The amount of data on molecular interactions is growing at an enormous pace, whereas the progress of methods for analysing this data is still lacking behind. Particularly, in the area of comparative analysis of biological networks, where one wishes to explore the similarity between two biological networks, this holds a potential problem. In consideration that the functionality primarily runs at...

full text

FOGSAA: Fast Optimal Global Sequence Alignment Algorithm

In this article we propose a Fast Optimal Global Sequence Alignment Algorithm, FOGSAA, which aligns a pair of nucleotide/protein sequences faster than any optimal global alignment method including the widely used Needleman-Wunsch (NW) algorithm. FOGSAA is applicable for all types of sequences, with any scoring scheme, and with or without affine gap penalty. Compared to NW, FOGSAA achieves a tim...

full text

A Fast Algorithm for Exonic Regions Prediction in DNA Sequences

The main purpose of this paper is to introduce a fast method for gene prediction in DNA sequences based on the period-3 property in exons. First, the symbolic DNA sequences were converted to digital signal using the electron ion interaction potential method. Then, to reduce the effect of background noise in the period-3 spectrum, we used the discrete wavelet transform at three levels and applie...

full text

My Resources

Save resource for easier access later


Journal title:
iranian journal of chemistry and chemical engineering (ijcce)

Publisher: iranian institute of research and development in chemical industries (irdci)-acecr

ISSN 1021-9986

volume 30

issue 2 2011

Hosted on Doprax cloud platform doprax.com

copyright © 2015-2023